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{57) Abstract 

A storage file system is provided for multimedia data, particularly for optical disc storage means. Each successive or alternate file 
(40, 46) of multimedia data included in the storage means is represented by a respective allocation extent list (42, 48) with each allocation 
extent (48.1) identifying storage means (44) location (2.1) for the files component fragments. Data may be shared by two or more files 
(40, 46) with each file having a respective allocation extent list entry referencing that segment and each such allocation extent maintaining 
byte-accurate pointers to the start and end of the relevant data within the segment. An allocation space table (Figure 4) containing respective 
entries for each reference to a whole or partial segment, and optionally also a free-space table (Figure 5) identifying those whole or partial 
fragments available for rewriting, may be provided to allow for rapid reclamation of free space within the storage medium. 
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1 

DESCRIPTION 

FILE SYSTEMS SUPPORTING DATA SHARING 

5 The present invention relates to methods and apparatus for data storage 

and retrieval, and to storage devices for such data - particularly, but not 
essentially, optical disc storage devices. 

Recent years have seen a great expansion in the connplexity of consumer 

10 electronics equipment with several different proprietary and technical standards 
governing interconnectivity and data storage. In connection with the latter 
feature, the domestic user has in the past used different mechanical and 
functional configurations of storage device, such as a VHS cassette for video 
recording, an audio compact cassette for audio recordings from Hi-Fi equipment, 

15 and hard and floppy discs for data storage on personal computers. With the 
advent of recordable optical discs conforming to unified standards as far as data 
layout, bit rates etc. are concerned, such discs may (if configured to the particular 
recording system) replace many of the disparate options, and hence the 
possibility of a single unified standard, both in terms of physical configuration and 

20 data management, may be contemplated for all types of domestic 
audio/video/data-processing systems. 

In conjunction with this increased commonality in the field of storage 
media, there is an increasing degree of requirement for sharing in the expected 
and actual functionalities of the devices themselves. Of particular relevance to 

25 the present invention is the digital recording of audio and/or video (AV) material, 
with the user coming to expect the same sort of facilities for arranging and editing 
of stored data that they experience from, for example, a personal computer. In 
terms of available devices, optical media were not particularly suited for video 
storage applications due to their hitherto limited storage capacity, although this is 

30 becoming less of an issue: for example, a disc according to DVD-ROM 
standards may store in the region of 8 hours of video compressed following 
MPEG2 protocols. Further developments are providing optical drives with a 
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capacity of tens of gigabytes per storage device (disc) and real-time video 
recording applications based around such storage devices are contemplated. 

Whilst many efficient schemes have been proposed for recording and 
storage of program data, it will be understood, however, that there are 

5 constraints to be observed in the storage of AV material which differ from those 
applied in the generation of efficient file structures and file handling in a purely 
data-oriented environment. Of particular relevance is the system-imposed desire 
for sections of video data (which may be interleaved with data for an 
accompanying soundtrack) to be stored contiguously such as to enable the 

10 encoded video data to be read fast enough to avoid presenting visible 
discontinuities to the viewer - for example to meet the applicable buffering 
constraints where the video data is MPEG encoded. 

Inefficient storage file structures can lead to problems in these areas and 
a number of strategies have been proposed for enhanced efficiency in 

15 multimedia data storage and retrieval, particularly for magnetic disc storage, as 
described in "Multimedia file systems survey: approaches for continuous media 
disk scheduling" by Ralf Steinmetz, Computer Communications Vol.18, No.3, 
March 1995 pp. 133-1 44. As is mentioned by Steinmetz, it is possible for data to 
be shared between files, a particular example of which would be different 

20 versions of a stored file (e.g. a censored children's version and an adult uncut 
version of a film) on a single disc with simple re-use of common sections of data 
where appropriate to avoid the need for storing full-length but only slightly 
different versions of a file. At present, however, insofar as the idea of re-usability 
of data has been concerned, it has been based on media where the data is 

25 subdivided into convenient uniform segments (for example sectors on a 
recordable disc) with sharing of file sections only being supported at the segment 
level. The result of this has been that either edited versions of a file are forced to 
include whole segments when only a small part thereof may have been desired, 
or the segment size has to be very small to permit flexibility in editing operations, 

30 which is both wasteful of disc space and creates an unacceptably high 
processing overhead. 
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It is therefore an object of the present invention to provide a storage 
means (and support means for the implementation of the same) with an internal 
data structure that supports multiple edits through re-use of whole or partial 
recording fragments/segments. 

It is a further object to provide such storage means utilising features of the 
internal data structure for the efficient reclamation of storage area containing 
redundant data in whole or partial fragments. 

In accordance with the present invention there is provided a storage 
device comprising a first area subdivided into segments for the recording of 
sequential data, portions of which when read in a predetermined sequence 
comprise a data file, the device further comprising a second area containing a 
list of allocation extents, each of which identifies the start and end of a 
contiguous part of the first area, and a pointer file containing a list of pointers 
to respective entries in the list of allocation extents; characterised in that the 
device carries at least two files defined in respective pointer files and at least a 
part of a segment is common to the two files, with each file having a respective 
allocation extent for that segment and each allocation extent indicating the 
start and end points within the segment for that part of the segment used in the 
respective files. By the provision of respective allocation extents for each 
usage of data from a segment, a simplification in the handling of multiple files 
containing shared data results. Furthermore, by having each allocation extent 
specify not just the particular segments but also the start and end points within 
those segments, the constraint for shared data having to be specified as a 
whole segment is removed, permitting greater flexibility in the possibilities for 
editing the stored data. 

The second area (which may be in the form of a plurality of discrete 
areas distributed amongst the segments of the first area) may further comprise 
an allocation space table containing an entry for each allocation extent, the 
table providing a record of all single or multiple uses of first area segments. 
With this table, the entries for which are created and/or updated with the 
allocation extents, it is not necessary for a device looking to delete or ovenA/rite 
segments or partial segments to scan all the allocation extents to identify 
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redundant segnnents, particularly where the space table is suitably arranged to 
provide a map of current first area usage. In this respect, the second area 
may further contain a free-space table the contents of which are derived from 
the contents of the allocation space table and identify all segments or parts of 

5 segments available for writing to. Although generation and maintenance of the 
free-space table requires extra processing capability in a hosting system, it 
does provide notable benefits in applications where large volumes are to be 
stored in various edited forms and free-space reclamation is a necessity. 

Each of the allocation extents may suitably identify a start point in terms 

10 of a number of bytes or other data subdivisions (e.g. MPEG2 data packs) to be 
ignored from the start of a segment, and/or an end point in terms of a number 
of bytes (or packs) to be ignored at the end of a segment. By specifying the 
start and end points as distances from an also specified segment start and 
end, backwards compatibility with older systems which only specify to a 

15 segment level is possible whilst retaining or maintaining compliance with block 
or pack-based encoding protocols such as MPEG. 

Whilst the storage medium may be a magnetic disc, in a preferred 
embodiment the storage device is an optical disc and the data written to 
segments of the first area comprises digitised audio and/or video material with 

20 the pointer files comprising respective playlists for all or selected parts of the 
material. This allows for example different versions of an AV presentation 
(such as different edits of a film) to be held on a single disc - perhaps coupled 
with system support for parental restriction on playback of some versions 
without requiring each version to be separately stored as a distinct and 

25 separate entity. Note that references herein to storage "in" and storage "on" a 
storage device or medium are used interchangeably, with neither being 
intended to refer to a specific form or configuration of storage device, unless 
explicitly stated. 

As a disc (optical or magnetic), all segments of the first area may be of 
30 a common size, and each of the first and second areas may comprise 
respective tracks or contiguous groupings of plural tracks. Note, however, that 
the division into first and second areas is necessary only from a logical point of 
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view, and the two areas may appear consecutively or adjacent in the chosen 
storage medium or may even be interleaved if handling for such an 
arrangement is within the capabilities of the read/write apparatus. Also, the 
respective pointer files for multiple files on a disc may be grouped together in a 

5 file table or may be distributed across the disc. 

Also in accordance with the present invention there is provided a 
method for formatting memory space in a storage device or medium, a data 
processing device operable to manipulate data in a first area of a storage 
medium or device formatted according to such a method, and such a 

10 processing apparatus further operable to implement the formatting method, all 
as described in the attached claims to which the readers attention is now 
directed and the disclosure of which is herein incorporated by reference. 

Preferred embodiments will now be described by way of example only, 
15 and with reference to the accompanying drawings in which: 

Figure 1 is a block schematic diagram of a data processing apparatus 
which may be configured to effect the formatting of the present invention; 

Figure 2 represents the relationship between a data file in the form of a 
playable object specified at database level to stored data fragments in a 
20 sequential recording medium; 

Figure 3 is similar to Figure 2 and shows a pair of playable objects sharing 
common data in respective fragments; 

Figure 4 represents an allocated space table for a pair of playable objects 
similar to those of Figure 3; and 
25 Figure 5 represents a free-space table derived from the allocated space 

table of Figure 4. 

Beginning with Figure 1, there are shown the basic components of a data 
processing apparatus such as may be used for the formatting and storing of data 
30 on an optical disc 10. The apparatus consists of a central processor (CPU) 12 
coupled with random-access (RAM) 14 and read-only (ROM) 16 memory 
devices via an address and data bus 18. An external interface (EXT l/F) 20 
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represents the apparatus connection with external data sources. As will be 
recognised, the configuration of this interface will be dependent on the type of 
external data source and the overall function of the data processing apparatus: 
for example, where the apparatus is a domestic video/audio recorder, the 
5 interface will provide the connection and reception means for the source of 
video/audio signals to be recorded (e.g. from a satellite receiver) or, where the 
apparatus is a personal computer, it may comprise a link to remote data sources 
by way of, for example, an Internet interface. 

Also coupled with the CPU 12, memories 14, 16, and interface 20 via the 
10 bus 18 are one or more user input means (UIP) 22 and a display 24; for a PC- 
based apparatus, these devices may comprise respectively a keyboard and 
monitor, whereas for a domestic recorder apparatus they may comprise user 
control buttons and an L.E.D display on the apparatus front panel. A further 
component is an interface to a storage medium, in this example an optical disc 
15 record/playback unit (DISC RA/V) 26 providing both the physical means to 
receive and read from/write to optical disc 10 and an internal set of operational 
protocols for reading/writing from/to a disc formatted according to a 
predetennined standard. As will be well understood, the protocol handling of the 
unit 26 may be effected by an internal slave processor with associated memory 
20 (not shown) under control of CPU 12, or these functions may be handled directly 
by CPU 12 with reference to protocol commands held at bootstrap level in ROM 
16 or periodically reloaded to RAM 14. 

In Figure 1, the optical disc 10 is shown with a nominal division into a first 
area 28 carrying a sequence of full, free, or partially written data segments, and a 
25 second area 30 carrying a number of control structures (file tables, allocation 
extent lists, free-space structures etc.) the function of which will be described in 
detail below. As previously stated, the division into discrete first and second 
areas is necessary only from a logical point of view, to distinguish between the 
pure data carrying and functional on-disc structures. In practical terms, although 
30 they may be distinct entities recorded on separate track areas of a disc (for 
example), for an operational embodiment savings in terms of reduced seek 
delays (and hence reduced access times) may be obtained through distributing 
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the second area amongst the first, for example with file and allocation extent 
tables describing whole and partial disc segment usage being placed on disc 
adjacent those segments referred to. 

Figure 2 illustrates the relationship between a data file entry (also referred 

5 to as a playable object) specified at database level and those individual 
fragments of data stored at various locations on disc which are to be replayed in 
the correct sequence to recreate the data file. Each file on the disc is 
represented by an entry (FE.1) 40 in a file table written to the second area. At 
database level, this file entry contains data about the file as a whole (its name, 

10 date of creation etc.) or where the file is in multiple sequential parts, the entry will 
uniquely identify the part and its place in the sequence. At the system level, this 
entry 40 references a list 42 containing one or more allocation extents AX 42.1, 
42.2..,42.n each of which identifies the start and end of a respective contiguous 
section (1.1, 1.2,...1.n) of linear storage, as represented by the sequence of 

15 storage areas 44 in the lower part of the Figure. 

In known manner, for ease of accessing, the linear storage 44 may be 
divided into a sequence of commonly-sized or commonly specified logical 
sectors or segments, the size of which may be set to a fixed number of bytes 
(such as 2048) or a predetermined other grouping of data such as encoded 

20 MPEG blocks. Note that the physical ordering of used segments need not be 
reflected in the physical order in which the data they carry is to be replayed. 
However, proximity of successive portions of a file is preferred as this minimises 
delays due to seek times and reduces the likelihood of problems with buffering 
constraints for time-critical data such as MPEG audio/video. 

25 Each of the allocation extents AX 42.1, 42.2...42.n specifies the respective 

portion of data 1.1, 1.2...1.n both in terms of the start and end points for the 
segment or segments that contain it, to enable backwards compatibility with 
systems that only specify to whole-segment granularity, and in terms of the 
number of bytes (or other arbitrary block divisions) that are to be ignored at the 

30 start and/or end of each segment. 

In Figure 3, a second file entry (FE.2) 46 is shown added which file entry 
has a respective list 48 of allocation extents AX 48.1, 48.2 each identifying 
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respective portions 2.1 , 2.2 of the linear sequence of segments 44. This second 
file sequence may comprise a distinct entity from the first 40, or it may be a 
subsequent but sequential part of a larger file specified by sequential entries in 
the file table. In either case, the point to note is the re-use of a part of the stored 
5 data in those segments where portions 1 .2 and 2.1 overlap. 

Figure 4 represents an additional structure, allocation space table 50, 
stored in the second area. A further allocation extent AX 48.3, physically and 
temporally contiguous with its predecessor 48.2 from the second file entry (FE.2) 
46 is also added. In general, each allocation extent will have a respective entry 
10 in the allocation space table 50 (42.1 = ASE1 = 50.1 ; 42.2 = ASE2 = 50.2; 42.n 
= ASEn = 50.n; 48.1 = ASEn+1 = 50.n+1). The exception to this is where there 
are pairs of contiguous, non-overlapping portions, as specified by allocation 
extents 48.2 and 48.3 when, for reasons of both data saving and efficiency, a 
single allocation space entry (ASEn+2) 50.n+2 is written to table 50. 
15 The allocation space table 50 provides, in a single location, an indication 

of the usage (full, partial or empty) for each segment (and hence each byte) of 
the first area, which table is updated each time new data, or a new edit of 
existing data, is created in the storage medium. For ease of creation, as shown 
the entries to the table may be written in file order - that is to say the usage for 
20 each allocation extent of the first file is listed before the usage for each allocation 
extent of the second file and so on. For enhanced ease of use, for example in 
systems seeking free segments or parts of segments for the interleaving of 
ancillary data to accompany a stored video sequence, the allocation space table 
may be arranged in sequential order of the specified start points regardless of 
25 which file the particular portion is used in, such that scanning to find localised 
free space does not require the checking of every entry in the list 50. Whilst this 
will require some additional processing when creating multiple edits of a block of 
data stored over many segments, handling the list 50 as a stack (and offloading 
to read/write apparatus RAM for updating) is not problematical. Furthermore, the 
30 benefits when seeking to reclaim redundant storage area, perhaps for ancillary 
data use or for further AV data as the disc/storage device becomes full, are 
notable. 



wo 00/23871 PCT/EP99/07451 

9 

An extension to the above-described facilitation for free-space 
reclamation is sliown in Figure 5 wliere, in addition to (or in place of) the table of 
allocation space data, a free-space table 52 is provided in the second area. 
Rather than comprehensively detailing file use of segments and parts of 

5 segments, the free-space table is derived from the allocation space data and 
lists, suitably as start and end points and In sequential physical order, those 
portions FSE 52.1 - 52.4 of the first area that are unused or, through a 
subsequent editing of stored material, those portions containing data 
unreferenced by any allocation extent which data is therefore (notionally) free for 

10 overwriting subject to any constraints on keeping data available for the creation 
of further files. With both an allocation space table and a free-space table stored, 
the allocation space table may be written sequentially on the fly, as in Figure 4, 
as the respective allocation extents are created, with the updating of the 
disc/storage media free-space list 52 (which may be a more lengthy process) 

15 being reserved to a housekeeping procedure for handling in "idle" moments for 
the writing system processor and/or at the conclusion of the edit creation. 

In terms of industrial application of the foregoing storage mechanism, a 
particular application is in disc-based video recorders where the amount of video 
material to be stored is relatively large: even compressed video material takes up 

20 a lot of storage space (of the order of 4Mb for 3 seconds of video) and capacity 
on disc-based storage media - particularly optical discs - is limited. Such a disc- 
based video recorder preferably supports some level of user-operated editing 
capability, where the editing application supported utilises parts of already- 
recorded video material. By sharing data between two recordings, a saving in 

25 disc capacity is achieved and, in circumstances such as multiple edits of a movie 
being provided on a single disc, the overall usage may be little more than that for 
a single edit of the movie. If recordings are, or consist of, data files, then sharing 
parts of a file in the underlying file system provides an efficient solution for 
sharing data between recordings. 

30 From reading the present disclosure, other variations will be apparent to 

persons skilled in the art. Such variations may involve other features which are 
already known in the methods and apparatuses for data management and 
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storage and component parts thereof and which may be used instead of or in 
addition to features already described herein, and the scope of the present 
invention is to be determined by the claims appended hereto. 
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CLAIMS 

1. A storage device comprising a first area subdivided into 
segments for the recording of sequential data, portions of which when read in 
5 a predetermined sequence comprise a data file, the device further comprising 
a second area containing a list of allocation extents, each of which identifies 
the start and end of a contiguous part of the first area, and a pointer file 
containing a list of pointers to respective entries in the list of allocation extents; 

characterised in that the device carries at least two files defined in 
10 respective pointer files and at least a part of a segment is common to the two 
files, with each file having a respective allocation extent for that segment and 
each allocation extent indicating the start and end points within the segment 
for that part of the segment used in the respective files. 

15 2. A storage device as claimed in Claim 1, wherein said second 

area further comprises an allocation space table containing an entry for each 
allocation extent, the table providing a record of all single or multiple uses of 
first area segments. 

20 3. A storage device as claimed in Claim 2, wherein said second 

area further contains a free-space table the contents of which are derived from 
the contents of the allocation space table and identify all segments or parts of 
segments available for writing to. 

25 4. A storage device as claimed in any of Claims 1 to 3, wherein 

each allocation extent identifies a start point in terms of a number of bytes to 
be ignored from the start of a segment. 

5. A storage device as claimed in any of Claims 1 to 4, wherein 
30 each allocation extent identifies an end point in terms of a number of bytes to 
be ignored at the end of a segment. 
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6. A storage device as claimed in any of Claims 1 to 5, wherein the 
storage device is an optical disc and the data written to segments of the first 
area comprises digitised audio and/or video material with the pointer files 
comprising respective playlists for all or selected parts of the material. 

5 

7. A method for formatting memory space in a storage medium to 
be subsequently accessed by a data reading apparatus, comprising the steps 
of: 

partitioning a first area of the memory space into a plurality of segments 
10 for the recording of sequential data, portions of which when read in sequence 
comprise a data file; 

generating and writing to a second area of the memory space a list of 
allocation extents, each of which identifies the start and end of a contiguous 
part of the first area; 

15 generating and writing to the second area of the memory space a 

pointer file as a list of pointers to respective entries in the list of allocation 
extents; 

characterised in that the storage medium contains at least two files with 
at least a part of a segment being common to the two files, wherein a 
20 respective pointer file is generated and written for each file and a respective 
allocation extent is generated for each use of a segment, those allocation 
extents including an indication of the physical locations of the start and end 
points within the segment for that pari: of the segment used in the respective 
files. 

25 

8. A method as claimed in Claim 7, further comprising the step of 
generating and writing to the second area an allocation table, the table 
containing a record of single or multiple uses of all first area segments. 



30 9. A method as claimed in Claim 8, further comprising the step of 

generating and writing to the second area a free-space table the contents of 



wo 00/23871 



13 



PCT/EP99/07451 



which are derived from the allocation space table and identify all segments or 
parts of segments available for writing to. 

10. A data processing apparatus operable to manipulate data in the 
first area of a storage means formatted by the method of Claim 7, the 
apparatus comprising a data processor coupled with storage medium 
read/write means configured to receive and access said storage medium, and 
an input to receive data identifying those segments or parts of segments 
comprising a file, the processor being arranged to generate and store a pointer 
file together with a list of allocation extents for the segment data, those 
allocation extents including an indication of the physical locations of the start 
and end points within each segment used in a file of the segment data for that 
file. 

11. Apparatus as claimed in Claim 10, being further operable to 
implement the formatting method of Claim 7, wherein the processor is 
configured to partition the memory space of a received storage device into said 
first and second areas and to further partition said first area into said plurality 
of segments. 

12. Apparatus as claimed in Claim 10 or 1 1 , wherein the processor is 
further configured to periodically update, or to generate and periodically 
update, an allocation space table in said second area which table contains an 
entry for each allocation extent and provides a record of single or multiple file 
uses of all first area segments. 

13. Apparatus as claimed in Claim 12, wherein the processor is 
further configured to periodically update, or to generate and periodically 
update, a free-space table in said second area the contents of which table are 
derived by the processor from the contents of the allocation space table and 
provide a record for those first area segments or parts of segments available 
for writing to. 
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14. Apparatus as claimed in any of Claims 10 to 13, wherein the 
storage means is a writable optical disc and the read/write means is configured 
accordingly, the data written to the segments in the first area thereof 
5 comprising digitised audio and/or video material with the pointer files being 
read as respective playlists for all or selected parts of the material. 
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